Arabic Automatic Speech Recognition: A Systematic Literature Review

نویسندگان

چکیده

Automatic Speech Recognition (ASR), also known as Speech-To-Text (STT) or computer speech recognition, has been an active field of research recently. This study aims to chart this by performing a Systematic Literature Review (SLR) give insight into the ASR studies proposed, especially for Arabic language. The purpose is highlight trends about and guide researchers with most significant published over ten years from 2011 2021. SLR attempts tackle seven specific questions related toolkits used developing evaluating ASR, supported type language, feature extraction/classification techniques, performance existing gaps facing researchers, along some future research. Across five databases, 38 met our defined inclusion criteria. Our results showed different open-source support recognition. prominent ones were KALDI, HTK, then CMU Sphinx toolkits. A total 89.47% retained cover modern standard Arabic, whereas 26.32% them dedicated dialects Arabic. MFCC HMM presented extraction classification respectively: 63% papers based on 21% HMM. review shows that systems depends mainly criteria availability resources, techniques acoustic modeling, datasets.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Syllable-based Automatic Arabic Speech Recognition

In this paper, we concentrate on the automatic recognition of Egyptian Arabic speech using syllables. Arabic spoken digits were described by showing their constructing phonemes, triphones, syllables and words. Speaker-independent hidden markov models (HMMs)-based speech recognition system was designed using Hidden markov model toolkit (HTK). The database used for both training and testing consi...

متن کامل

Automatic diacritization of Arabic transcripts for automatic speech recognition

Arabic orthography does not provide full vocalization of the text, and the reader is expected to infer short vowels from the context of the sentence. Inferring the full form of a word is useful when developing Arabic speech and language processing tools, since it is likely to reduce ambiguity in these tasks. In this paper, we present generative techniques for recovering vowels and other diacrit...

متن کامل

Automatic Speech Recognition System: A Review

Speech is the most prominent & primary mode of Communication among human beings. Now-a-days Speech also has potential of being important mode of interaction with computers. This paper gives an overview of Automatic Speech Recognition System, Classification of Speech Recognition System and also includes overview of the steps followed for developing the Speech Recognition System in stages. This p...

متن کامل

Lexical and phonetic modeling for Arabic automatic speech recognition

In this paper, we describe the use of either words or morphemes as lexical modeling units and the use of either graphemes or phonemes as phoneticmodeling units for Arabic automatic speech recognition (ASR). We designed four Arabic ASR systems: two word-based systems and two morpheme-based systems. Experimental results using these four systems show that they have comparable state-of-the-art perf...

متن کامل

An enhanced automatic speech recognition system for Arabic

Automatic speech recognition for Arabic is a very challenging task. Despite all the classical techniques for Automatic Speech Recognition (ASR), which can be efficiently applied to Arabic speech recognition, it is essential to take into consideration the language specificities to improve the system performance. In this article, we focus on Modern Standard Arabic (MSA) speech recognition. We int...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Applied sciences

سال: 2022

ISSN: ['2076-3417']

DOI: https://doi.org/10.3390/app12178898